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Abstract 



o 

. Recent developments in sensing technologies have enabled us to examine the nature of human social 

\ behavior in greater detail. By applying an information theoretic method to the spatiotemporal data 

(— I ' of cell-phone locations, [C. Song et al. Science 327, 1018 (2010)] found that human mobility patterns 

I ' are remarkably predictable. Inspired by their work, we address a similar predictability question in 

o 

O 



a different kind of human social activity: conversation events. The predictability in the sequence of 
one's conversation partners is defined as the degree to which one's next conversation partner can be 
predicted given the current partner. We quantify this predictability by using the mutual information. 



c/3 

^ ■ 

Q_|. We examine the predictability of conversation events for each individual using the longitudinal data of 



face-to-face interactions collected from two company offices in Japan. Each subject wears a name tag 
equipped with an infrared sensor node, and conversation events are marked when signals are exchanged 
between sensor nodes in close proximity. We find that the conversation events are predictable to a 
certain extent; knowing the current partner decreases the uncertainty about the next partner by 28.4% 
on average. Much of the predictability is explained by long-tailed distributions of interevent intervals. 
However, a predictability also exists in the data, apart from the contribution of their long-tailed 
, nature. In addition, an individual's predictability is correlated with the position of the individual in 
\ the static social network derived from the data. Individuals confined in a community - in the sense of 
an abundance of surrounding triangles - tend to have low predictability, and those bridging different 
communities tend to have high predictability. 

PACS numbers: 89.75.Fb, 89.75.Hc, 64.60.aq, 02.50.Ey 
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I. INTRODUCTION 



Recently, interest in the statistical and dynamical features of human social behavior has 
been growing, enabled by the development of new devices that allow tracking of social data 
in real time, with increasing precision and duration jl-9|. A remarkable recent finding from 



the analysis of spatiotempora^ 



are highly predictable 



ay 



data on cell-phone locations is that human mobility patterns 



a finding that is in contrast to the traditional view. For 



instance, in epidemic models that take the mobility of subjects into account, subjects are 



usually assumed to perform a conventional random walk from one location to another |12l. 113 1. 
However, actual traveling patterns of humans often deviate from such random walk models, 
and the displacement distribution follows a power law Furthermore, the statistics of 

the next location of the individual is affected not only by the current location, but also by the 



history o 
patterns 



the traveling pattern, resulting in approximately 90% predictability of the mobility 



io|. 



In this study, we address a similar predictability question for a different component of human 
social behavior: conversation events. Conversation events mediate the spreading and routing of 



diverse contents such as new ideas, opinions, and infectious diseases in social networks |l5l. Il6|. 
In models describing these phenomena, it is a norm that each individual possesses a dynamically 
changing state {e.g., opinion A or opinion B in opinion dynamics, and susceptible or infected 
state in epidemic dynamics). The law of transition from one state to another is usually assumed 
to be Markovian, i.e., independent of the history of the process. The Markovian property, which 
is a type of unpredictability, is an assumption for simulating such dynamics based on a static 



social network 
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16|. 



However, the plausibility of this assumption is unclear. Imagine the office that you share 
with other colleagues in your company. When you have a question about a project, you may 
talk to your boss. After this conversation event, you may tend to talk to a particular individual 
to communicate the instruction of the boss. During lunchtime, you may chat with your close 
colleagues in a particular order that you do not perceive. How predictable is your choice of 
your next conversation partner given the current partner? 

We examine the predictability of conversation events using two sets of longitudinal data 
collected from company offices in Japan. We use the information about the timing and duration 
of conversations between each pair of individuals, but do not use a priori knowledge about status 



or other social attributes of individuals. Our data are unique in that they are collected from a 
relatively high number of individuals {i.e., approximately 200 individuals) over a long recording 
period {i.e., approximately three months). We examine the sequence of conversation events for 
each individual. We find that a conversation event has notable deterministic components. In 
other words, the uncertainty about the next partner that you talk with decreases by 28.4% on 
average, given the identity of the partner you are currently talking with (see Sec. IIII Bp . 

It should be noted that our approach is related to, but different from, the studies of power- 
law interval distributions in conversation events. The interval between successive conversation 
events for an individual or a given pair of individuals often follows a power law [l|, O-fl 



Modeling studies have revealed implications of these empirical results in contagions [4, la. 
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25l ] and opinion formation 26|, |27(|. In contrast to conventional models in which the Poisson 



interval distribution is assumed, these results indicate that the next conversation time given 
the previous one is relatively predictable in that a conversation event in the recent past is a 
precursor to a burst of events in the near future. We argue that the bursty nature of the point 
process largely contributes to the predictability of conversation events. 

We also show that the degree of predictability depends on individuals. Individuals located 
inside a network community, i.e., a dense subnetwork loosely connected to other parts of the 

quantified in this study via strong links and the clustering coefficient, 



entire network 
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behave relatively randomly. On the other hand, individuals that connect different communities 
by weak links tend to have a high predictability. 



II. DATA AND METHODS 



We analyze two sets of face-to-face interaction logs obtained from different company offices 



using the Business Microscope system developed by Hitachi, Ltd., Japan [29|,|30|. The data were 
collected by World Signal Center, Hitachi, Ltd., Japan. Data set Di consists of recordings from 
= 163 individuals for 73 days. Data set D2 consists of recordings from N = 211 individuals 
for 120 days. Each subject wears a name tag strapped around the neck and placed at the 
chest, and each name tag contains an infrared module. The infrared modules can communicate 
with each other if they are less than 3 meters apart. An infrared module only senses the 
modules situated within a 120° circular sector in front of the name tag, and the system detects 
conversation events only when two individuals are facing each other. Communication between 



modules includes exchanging the owners' IDs every 10 sec. We regard two individuals to be 
involved in a conversation event if their infrared modules communicate with each other at least 
once in 1 min. In other words, the time resolution of the system is equal to one minute. The list 
of conversation partners and time stamps is stored in the name tag of each individual and sent to 
the central database on a daily basis. The data transfer occurs when the individual leaves work 



and puts the name tag on a gateway device connected to the individual's computer [29|, |30 |. 
Each data set contains a list of conversation events, as shown in Fig. [T] A conversation event is 
specified by the IDs of the two individuals talking with each other, the date and time at which 
the dialogue starts, and the duration of the dialogue. We are not concerned with the content 
of the dialogue. Data sets Di and D2 contain 51,879 and 125,345 events, respectively. 

We investigate the predictability of each individual's conversation patterns. Our prelim- 
inary data analysis revealed that the timing of conversation events lacks sufficient temporal 
correlation and is unpredictable. Therefore, we neglect the timing of conversation events in the 
data unless otherwise stated and focus on the partner sequence defined as follows. To generate 
the partner sequence of individual 1, we first sift out all the conversation events that involve 
individual 1 from the entire data set (Fig. [^b)). Next, we ignore the time stamp and duration 
of the conversation events. The remaining data define the partner sequence, i.e., the chrono- 
logically ordered sequence of the IDs of the conversation partners for individual 1 (Fig. [II^c)). 
When multiple conversation events involving individual 1 are initiated in the same minute, we 
determine their order at random. 

To evaluate the predictability of the partner sequence, we calculate three entropy measures, 
inspired by those used for the analysis of human mobility patterns [lo|. First, we define the 
random entropy for individual i as 

H^^log.h, (1) 

where ki represents the number of i's partners for the entire recording. If i chooses the part- 
ner with equal probability l/Zcj from all the z's acquaintances in each conversation event, 
quantifies the degree of randomness. Second, we define the uncorrelated entropy as 

^- (2) 

where A/^ is the set of i's partners containing ki elements. Pi{j) represents the probability that 
individual i talks with individual j in a conversation event for i; the normalization is given by 



Ylij^Mi-^i^j) = 1- Compared to H} accounts for the heterogeneity among (j 7^ i). 

Third, we define the conditional entropy as 



j6M ^eM 



where Pj(£|j) represents the conditional probability that individual i talks with individual i 
immediately after talking with individual j. Hf measures the second-order correlation in the 
partner sequence of i. For each individual, < Hf < H} < is satisfied. We quantify the 
predictability of the partner sequence by the mutual information as follows: 



where Pi{i,j) represents the joint probability that individual i talks with individual i immedi- 
ately after talking with individual j. For each individual, < Jj < is satisfied, /j quantifies 
the predictability of the partner sequence; it is equal to the amount of the information about 
the next partner that is earned by knowing the current partner. When the partner sequence 
lacks a second-order correlation such that = Hf, li takes the minimum value 0. In this case, 
knowing the current partner does not help predict the next partner at all. When the partner 
sequence is completely deterministic, i.e., the next partner is completely predicted from the 
current partner such that Hf = 0, /j takes the maximum value H^. 

Although our primary interest in this study is the temporal properties of partner sequences, 
we also analyze the conversation networks (CNs) Gi and G2 constructed by aggregating all the 
conversation events in Di and D2, respectively, over the entire recording. In a CN, the node 
represents an individual, and the weight of the link, denoted as Wij, represents the number of 
conversation events between individuals i and j during the entire recording period. By the 
definition of the conversation event, Wij = wji = 1,2,- ■■ ,N) holds true; the CN is an 
undirected network. The degree ki of individual i is equal to the number of j's for which 
Wij > 0. 

III. RESULTS 

A. Properties of the CN 




We found that both CNs, Gi and G2, are composed of a single connected component. The 
CN Gi is visualized in Fig. [21 we will analyze the relation between the CNs and the predictability 
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in Sec. IIII CI The clustering coefficient 3l| of the unweighted versions o 



to 0.646 and 0.611, respectively. The Pearson assortativity coefficient 



Gi and G2 is equal 



32| of the degree of Gi 



and G2 is equal to 0.169 and 0.296, respectively. Therefore, the CNs have typical properties of 



social networks 



33| . i.e., high clustering and positive assortativity. 



For the two CNs, we measure the distributions of degree, node strength, and link weight 



The node strength Sj is the sum of link weights connecting to node i 
number of conversation events for individual i, defined as 



34 



35| . i.e., the total 



Si 



(5) 



The mean and standard deviation of ki of Gi and G2 are equal to 26.07 ± 11.01 and 69.56 ± 
29.47 (mean ± standard deviation), respectively. Because two individuals are adjacent if there 
is at least one conversation event for a few months, the mean ki of both networks is relatively 
large. Sj of Gi and G^ is equal to 636.6 ± 516.7 and 1188.1 ± 622.1, respectively. Wij of Gi and 
G2 is equal to 24.41 ± 53.69 and 17.08 ±45.77, respectively. The cumulative distribution of the 
three quantities are shown in Fig. |3l 



B. Predictability of partner sequences 

We examine the predictability of partner sequences using the entropy measures. Because the 
estimation of entropy is notoriously biased when the data size is small, we discard individuals 
with less than 100 conversation events {i.e., Si < 100). There remain 146 and 210 individuals 
in data sets Di and D2, respectively after the thresholding. Because the results for the two 
datasets are similar, we report the results for Di in the following. The results for D2 are given 
in Appendix A. 

The histograms of the three types of entropies for partner sequences are shown in Fig. Ht^a). 
For all the individuals, H} is at least 9.94% smaller than H'^ . This implies that individuals 
exhibit a preference when selecting partners from their neighbors in the CN. 

The values of H} and iff for each individual are shown in Fig. lUl^b). The mutual information 
Jj = H} — Hf is positive for all the individuals regardless of the value of H^. In general, 
the finite size effect decreases H} and Hf by different amounts such that the estimated Jj is 



generally inherited with a positive bias 36|. For our data, the positive values of Jj are not an 



artifact caused by the small data size. Through a bootstrap test (see Appendix B for details), 
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we confirmed that the empirical values of li are significantly (at 1% level) larger than the 
values obtained from the bootstrap samples. In short, the bootstrap samples are randomized 
partner sequences that destroy temporal correlation in the data but preserve the original H} 
and account for the portion of Jj derived from the finite size effect. It should also be noted that 
we determined the order of partners at random when conversation events with different partners 
initiate in the same minute. This randomization does not make /j larger because it conserves H} 
and makes larger than the true value. In fact, the Pearson correlation coefficient between Jj 
and the fraction of such overlapping conversation events for individual i (1 < « < A^, Sj > 100) 
is slightly negative {i.e., —0.0811). In summary, the information about the current conversation 
partner gives the information about the next partner; is, on average, 28.4% smaller than 
Hi 

The predictability present in the data is mainly explained by the bursty activity patterns, 
i.e., long-tailed distributions of the interevent intervals, that have been observed for various 
data [l,J-6,8,17-20|. Our data also possess this feature (see Appendix C for details). Because 
the interevent interval for a given pair of individuals obeys a long-tailed distribution, individual i 
tends to talk with individual j again within a short period from their previous conversation. In 
the remainder of this section, we show that the predictability is mainly caused by the bursty 
activity patterns (Fig. EJ^a)) and that predictability also exists in the data even if we omit the 
bursts from the data (Fig. EJ^b)). 

We examine the contribution of the bursty activity pattern to the predictability by calcu- 
lating the mutual information j]^"''*** of the randomized partner sequence. The randomization 
of the interevent intervals between each pair of individuals is realized by swapping interevent 
intervals of the original data within each day in a completely random order (see Appendix D 
for the precise methods). Because of the computational cost of the randomization procedure, 
we obtain the mean and standard deviation of J]^"''^* from 100 randomized partner sequences, 
instead of estimating the confidential interval of The mean l\>^"^^^ accounts for 79.5% 

of the original Jj on average (Fig. El^a)). Because the randomization procedure preserves the 
interevent interval distribution. Fig. El^a) suggests that a large Jj is mainly attributed to the 
bursty activity patterns. It should be noted that I^^"^^^ is large partly because the randomizing 
procedure conserves the timings of the first and last conversation events of each pair on any 
day. Therefore, we may be overestimating the contribution of burstiness to Jj. 

The predictability is not solely determined by the bursty activity patterns. To clarify this 



point, we calculate the mutual information of the modified partner sequence generated 

by merging the consecutive conversation events with the same partner in the original partner 
sequence into one event. This merging procedure allows us to eliminate the contribution of the 
bursty activity pattern to the predictability. For example, if individual i talks with individual j 
3 times without being interrupted by other partners, we merge the three conversation events 
into one. The values of J^'^'^^^ are shown in Fig. El^b). To confirm that the positive values 
of J^^'^^'^ are not an artifact caused by the small data size, we carry out a bootstrap test for 
jmerge gjj^jjg^j, |-q for Jj. By definition, no partner ID appears successively in the merged 
partner sequence. Therefore, we generate the bootstrap sample of the merged partner sequence 
by sampling from the merged sequence with replacement under the condition that the same 
partner is not consecutively chosen (see Appendix D for details). is significantly larger 

than the values obtained from the bootstrap samples. Therefore, the original partner sequence 
possesses some predictability even after removing bursts originating from the bursty nature. 

C. Variation among the predictabilities of individuals 

The predictability, quantified by Jj, depends on individuals. In this section, we investigate 
the relationship between the predictability of individuals and the properties of nodes in the CN. 
The results shown in this section are summarized as follows. First, Jj is negatively correlated 
with node strength Si and with mean node weight defined as Wi = J2jeAfi '^ijl (Fig- El) • Second, 
the CN possesses the "strength of weak ties" structure (Fig. [7](a)). Third, the individuals 
bridging different communities with weak links tend to have large Jj, and those concealed in a 
single community and surrounded by strong links tend to have small Jj (Fig. ID^b)). 

One may speculate that Jj is strongly affected by the node degree ki because = log2 ki 
and and Hf comprise many terms if ki is large. However, ki and Jj are uncorrelated, as 
shown in Fig. El^a). We found that Jj is negatively correlated with Sj (Fig. El^b)) and with 
Wi (Fig. EI^c)). Using the bootstrap test, we verified that the negative correlation shown in 
Fig. Mjo) andint^c) is not because of the finite sampling size (see Appendix B for details). The 
correlation shown in Fig. [6] and the following results do not qualitatively change if we use the 
normalized mutual information js^] h/H} (see Appendix E). We also verified that alternatively 
defining the link weight by the total duration of the conversation events for each pair, instead of 
the total number of the conversation events, does not qualitatively change the results described 
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in this section (see Appendix F for details). 

For a fixed fcj, both Si and Wi decrease with the number of weak hnks {i.e., the hnks with 
small weight) connected to individual i. This fact leads us to hypothesize that individuals 
surrounded by weak links select partners in a relatively deterministic order. According to 
Granovetter's theory of the strength of weak ties, weak links tend to interconnect different 
communities in a social network and bring valuable external information to both end nodes, 
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. Therefore, the individuals bridging 



while strong links tend to be intracommunity links 
different communities with weak links may have large values of /j. 

We first verify the strength of weak ties hypothesis in the CN. The network visualized in 
Fig. [2] appears to be consistent with the hypothesis; weak links tend to connect communities 
composed of strong links. To quantify the extent to which a link is engaged in intracommunity 



connection, we measure the relative neighborhood overlap of a link 39|] , defined as 

0..= l-^-^-^^-l (6) 

where |-| denotes the number of elements in the set. When Oij = 0, individuals i and j do 
not have a common neighbor and the link {i,j) is considered to connect different communities. 
When Oij = 1, individuals i and j share all of the neighbors and the link is confined in 
a community. The strength of weak ties hypothesis suggests that Oij is positively correlated 



with Wij 



39| . In Fig. [Zl^a), Oij averaged over the links with weights smaller than w, denoted as 
{0)w, is plotted against the fraction of links with weights smaller than w, denoted as Pcnm{w). 
Because (0)^ monotonically increases with Pcum(w^); the CN possesses the strength of weak 



ties property, as in the case of mobile communication networks 

Because weak links are associated with a large Jj (Fig. El^c)) and intercommunity links 
(Fig. [Tl^a)), individuals with a large /j are expected to bridge different communities and those 
with a small Jj are expected to be shielded inside a community. This concept is consistent with 
the visual inspection of Fig. |2j To verify this point, we show that is negatively correlated with 
a calibrated clustering coefficient in the following (Fig. [Z(b)). Note that, when the clustering 
coefficient is large, the individual tends to be inside a community quantified by the abundance of 



triangles 



40|. When it is small, the individual tends to connect different communities {41, |42|. 



The clustering coefficient for each node is defined by Ci = (number of triangles including 



individual i)/[fci(fci - l)/2] (0 < Q < 1, i = 1, 2, ■ ■ ■ , iV) |3l|. In Fig. [T^b), the Pearson 



correlation coefficient between /j and Ci{wthr) is plotted against Wthr, where Ci{wthr) is the 



local clustering coefficient Cj for the subgraph of the CN generated by eliminating the links 
with weights smaller than Wthr- We opted to use Cj(tfthr) instead of the weighted clustering 



coefficient defined for weighted networks 



35, 



43| because the latter quantity is, by definition. 



strongly correlated with Si and Wi] we already discussed the negative correlation between Jj 
and Si and between Jj and Wi in Fig. Mjo) and[6]|^c), respectively. For Wthr = 1, -^i and Ci^Wthr) 
are almost uncorrelated. This is because almost all the individuals have a large regardless 
of li in the original CN Gi (refer to Fig. [2] for a visual conffimation of this statement). For 
2 < Wthr < 100, li and Cj(wthr) are negatively correlated (squares in Fig. [Tl^b)). Therefore, an 
individual with a large li tends to bridge different communities as quantified by the clustering 
coefficient. An individual with a small li tends to be confined within communities. The circles 
in Fig. Wljo) represent the partial correlation coefficient between Jj and Ci(wthr), with /cj(wthr) 
and Sj(u'thr) fixed. Here, /ci(u'thr) and Si{wthr) are, respectively, the degree and strength of 
individual i, calculated after eliminating the links with weights smaller than Wthr- Because the 
Pearson and partial correlation coefficients behave similarly, the negative correlation between Jj 
and Ci{wthr) is not ascribed to the negative correlation between li and Si (Fig. [6]|^b)) or between 
li and Wi (Fig. EJ^c)). 



In closing this section, we stress the robustness of our results against observation failures. 
The wearable tag used in our measurement fails to detect a conversation event if the tag is sealed 
behind obstacles such as a desk or partition. For example, suppose that two individuals chat for 
five minutes and either of their tags is just under a desk and is undetected in the third minute. 
Then, the single conversation event is split into two spurious conversation events, each lasting 
for two minutes. To examine the robustness of our results against such observation failures, 
we repeat the same set of analyses after ffiling short intervals between successive conversations 
between the same pair of individuals. If individual i has two successive conversation events with 
individual j and the interval between the two events is smaller than or equal to m minutes, 
we merge the two events into one. The original partner sequence corresponds to m = 0. The 
number of conversation events decreases with m. The interpolation reduces Wij, Si, and Wi and 
conserves ki, Hf, and Cj. We conffimed that our findings are reproduced when we interpolate 
the original data with m = 1 and m = 5 (see Appendix G for details). 
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IV. DISCUSSION 



We have shown that sequences of conversation events have deterministic components. The 
entropy in the distribution of the conversation partners of an individual decreases by, on average, 
28.4% for data set Di and 34.8% for data set D2, if we know the current partner. Much of 
the predictabihty of conversation events results from the bursty activity patterns. In general, 
daily and weekly rhythms of human activity can cause bursty activity patterns 20|. During 



the night and weekend, the individuals are out of the office. Therefore, interevent intervals 
are usually longer than those within working hours. Nevertheless, we consider that the effects 
of such long interevent intervals on the predictability of conversation partners are small. This 
is because the fraction of long interevent intervals, i.e., those over five hours, for example, is 
relatively small, occupying 4.31% in Di and 2.95% in D2. In addition, there is no particular 
reason to believe that the last conversation partner in a day and the first partner in the next 
day are specifically correlated. In this study, we did not correct for the effect of the night and 
weekend. 

The degree of predictability depends on individuals. In particular, we have shown that 
individuals connecting different communities in conversation networks behave relatively deter- 
ministically. We quantified the degree to which an individual is confined in communities by the 
clustering coefficient. In the context of an overlapping community structure, individuals con- 



nect different communities when they belong to multiple overlapping communities |40|. Such 
individuals tend to be surrounded by many triangles if we define the community by 3-cliques 
{i.e., triangles). This apparently contradicts our results. This contradiction comes from the 
difference in what we mean by connecting different communities. We regard individuals as 
bridging different communities when they are not strongly bound to any community and they 
have links to different communities. In this sense, nodes with small clustering coefficient values 
connect different communities in networks with hierarchal structure 



42 



44|. In general, links 



bridging different communities have large betweenness centrality values 45|. The clustering 
coefficient of a node tends to decease with the betweenness centrality 46|. This lends more 
support to our view that individuals with small clustering coefficient values tend to connect 
different network communities. It should be noted that the strength of weak ties property of 
the CN and the relationship between Jj and the individual's position in the CN are preserved, 
if we define the link weight by the total duration of the conversation events for each pair (see 
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Appendix F). 

We do not have an access to the contents of dialogs for ethical reasons. Therefore, the under- 
standing of the reason for the correlation between the individual's position and predictability is 
limited. Nevertheless, individuals that own many weak links and connect distinct groups may 
mediate information flows necessary to coordinate tasks involving these groups {e.g., project 
groups in a company). Such individuals may control the information flow between the groups 
in a rigid manner to yield a large Jj. In contrast, individuals with few weak links may enjoy 
casual (and perhaps creative) conversations within their own groups to choose the partners in 
a random manner. Such individuals may tend to have a small Jj. It should be noted that our 
data were obtained in company offices. Roles or formal positions of individuals in the company 
may affect Jj and the local abundance of weak links surrounding the individuals. 



Song et al. discovered a remarkable predictability in the mobility patterns of humans |10 |. 
In terms of the analysis tools, our methods are similar to theirs. We have applied the entropy 
measures and the concept of predictability to different types of data sets. In our data, the phys- 
ical location of individuals is irrelevant; individuals work in offices in the companies. It should 
be noted that although we have not implemented the prediction algorithm, the predictability 
of the data is implied by the large mutual information that we observed. This logic parallels 
that made for human mobility patterns 10 1. 
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APPENDIX 



A. Results for data set D2 

We obtained qualitatively the same results for D2 as those for Di. The results for D2 are 
shown in Figs. \K2\ IA3[ and IA4t which correspond to Figs. IH El and [7] in the main text, 
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respectively. 



B. Details of the bootstrap test 

To confirm that the large value of the empirically obtained Jj is not because of the small data 
size, we carry out a bootstrap test as follows. First, we make a bootstrap sample of a partner 
sequence with length Si by resampling partners' IDs from the empirical partner sequence of 
individual i without replacement {i.e., shuffling). Then, we use Eq. (j4j) to calculate the mutual 
information Jj for the bootstrap sample. By resampling 5,000 bootstrap partner sequences, we 
construct the distribution of Jj, which we denote as p{Ii). On the basis of p{Ii), we carry out 
a hypothesis test for /j. The null hypothesis of the test is that /j is positive just because of 
the small data size. The alternative hypothesis is that Jj is larger than the value expected for 
unstructured data of a small size. We set the significance level of the test to 1%. Consequently, 
the critical region of the null hypothesis is the half-open interval above the 99 percentile point 
of p{Ii). In Fig. IA5t the results of the bootstrap test are summarized. Apparently, Jj is above 
the 99 percentile point {i.e., the upper end of the each error bar). In fact, for all the individuals 
in Di and D2, except individual 14 in Di and 149 in D2, the null hypothesis is rejected with a 
1% significance level. 



C. Long-tailed behavior of interevent intervals 

Human activity patterns are characterized by long-tailed distributions of the interevent 



intervals jl|, y-|6|, |8|, Il7l-l20|. a feature that is shared by our data. We define the interevent 
interval r as the interval between the initiation time of two successive conversation events 
involving a given individual. The unit of r is a minute, corresponding to the time resolution 
of the recording. As shown in Fig. IA6( a). the distribution of r, denoted by p(r), for a typical 
individual in Di is long-tailed. The tail of the empirical data (solid line) is much fatter than that 
of the exponential distribution whose mean is equal to that of the empirical data (dashed line). 
The histogram of the coefflcient of variation (CV) of p(r) on the basis of all the individuals in 
Di and the same histogram for D2 are shown in Fig. lAGt b). The value of CV is equal to the 
ratio of the standard deviation to the mean and is equal to unity for exponential distribution. 
Figure IA6( b) indicates that the CV of p(r) is much larger than unity for all the individuals. 
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D. Components of the predictability of conversation events 



A possible mechanism governing the predictabihty of the conversation events is the bursty 
activity patterns. To examine the effect of the long-tailed behavior of p{t) on the predictability, 
we carry out a statistical test based on the shuffling of Jj as follows. Consider the sequence 
of conversation events of focal individual i with individual j. If i and j talk four times in a 
given day and the interevent intervals are equal to ri, T2, and Ts in the chronological order, we 
randomize their order. For example, the interevent intervals in the shuffled data are ordered as 
T2, Ti, and Ts. We carry out the same randomization for each day and each partner j. Then, 
we combine the randomized sequences {i.e., point processes) for different j's into the one point 
process from which we read out the randomized partner sequence for i. We define I^^^^^ as the 
mutual information for this randomized partner sequence. In Fig. M^a), the mean and standard 
deviation of J^'"^'^* obtained from 100 randomized partner sequences are shown for different 
individuals in Di. The empirical values of Jj (circles) are significantly larger than j!^"''^* for 
most individuals. However, J^'^^^^ consistently occupies a large fraction of /j and increases with 
/j. Therefore, the burstiness is a major cause of the predictability regardless of the value of /j. 

The burstiness is not the only contributor to the predictability. To show this, we examine the 
reduced partner sequence generated by merging all the consecutive events with the same partner 
into one event. For example, the original partner sequence {2, 3, 3, 6, 4, 4, 3, 3, 3, 2, 6, 2} yields 
the merged partner sequence {2, 3, 6, 4, 3, 2, 6, 2}. We calculate the mutual information in the 
merged partner sequence, denoted by J™'^''^'^. j™erge ^^leasures the predictability of conversation 
events that does not result from the burstiness. We do not directly compare j^^'^^^ with the 
original /j because the merging procedure shortens the length of the partner sequence and the 
amount of mutual information generally depends on the length of a sequence js^]. Instead, 
we carry out a bootstrap test for J^'^^^^_ By definition, the partner changes every time in the 
merged partner sequence. We obtain bootstrap samples respecting this property as follows. 
The frequency with which partner j appears in the merged partner sequence of individual j 
is denoted by p™*^''s*^(j). We select the first partner of i, denoted by i, randomly according to 
pmergc^jy The secoud partner is selected according to P™'^'^^°(j)/(1 — P^'^'^^'^ {£)) , where j 7^ £. 
We repeat the same procedure until the generated sequence becomes as long as the merged 
partner sequence. Figure El^b) summarizes the results of the bootstrap test for jmerge 
consistently larger than the values expected for the bootstrap samples for all the individuals. 
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Therefore, the partner sequence is predictable to some extent even without the effect of the 
bursty activity patterns. 



E. Use of normalized mutual information 

In the field of cluster partitioning, the normalized mutual information Jj = li/H} is used to 
quantify the accuracy of partitioning; methods, because the relationship < Jj < 1 is convenient 



for comparing different methods 37|. Our main results are qualitatively the same if we replace 
/, by/, (Fig.lMD. 



F. Alternative definition of the link weight based on the duration of conversation 

In the main text, we defined the link weight by the total number of conversation events for 
each pair. An alternative definition is given by the total duration of the conversation events for 
each pair. This alternative definition changes Wij, Si, and Wi and conserves ki, H^'^'"^, and /j. 
For the CN where the link weight is defined by the total duration, we repeat the same set of 
analyses as that conducted in Sec. IIII CI As shown in Fig. IA8[ the change in the definition of 
the link weight does not affect our main results. We observed a negative correlation between Jj 
and Si (Fig. lAS^ a)). that between Jj and Wi (Fig. lAS^ b)). the "strength of weak ties" property 
(Fig. IA8( c)). and a negative correlation between J, and Cj(tt'thr) (Fig. IA8( d)). 

G. Robustness against observation failures 

To examine the robustness of our results against observation failures, we analyze the data 
sets after interpolating short intervals between successive conversations between the same pairs 
of individuals. Suppose that individuals i and j talk with each other twice and that i does 
not talk with anybody else between the two conversation events with j. We merge the two 
conversation events into one if the difference between the ending time of the first event and the 
starting time of the second event is less than or equal to m minutes. 

In Fig. \A9\ Si, H}, Hf, and /j, which are the quantities calculated for the data obtained 
with 771 = 1, are compared with Sj, H}, Hf, and /j, respectively. As expected, is smaller than 
Si, and Hi and Hf are generally larger than H^ and Hf, respectively. As shown in Fig. lAlOj 
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the important properties of the data sets are not changed by the interpolation with m = 1. In 
other words, a negative correlation between /j and Sj (Fig. lAlO( a)) and that between Jj and 
Wi (Fig. lAlOT b)). the strength of weak ties property (Fig. lAlOr c)). and a negative correlation 
between Jj and Ci(wthr) (Fig. lAlOr d)) are observed. The results are qualitatively the same for 
m = 5, as shown in Figs. lAllI and IA12[ 
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FIG. 1. Procedure for generating the partner sequence of individual 1. (a) Original data set. (b) List 
of conversation events that involve individual 1. (c) Partner sequence of individual 1. The data set 
shown in (a) is an artificial one, and is provided for the purpose of explanation. 
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FIG. 2. Visualization of CN Gi. For clarity, only the nodes with strengths larger than 100 and the 
links among them are drawn. The darkness of the node color represents the value of /j; a darker node 
has a larger Ij. The thickness of the link is proportional to its weight. The links with weights larger 
than or equal to (smaller than) the median value (i.e., 5) are drawn by red (blue) lines. 
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FIG. 3. Cumulative distribution of (a) degree, (b) node strength, and (c) link weight of the CNs. 
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FIG. 5. Results of the bootstrap test for Di. The circles represent /j and in (a) and (b), 

respectively. The error bars represent the statistics for the bootstrap samples, (a) Results of the 
shuffling test. Ij and the error bars are plotted in the ascending order of /j. The error bars indicate 1 
standard deviation around the mean of 7^"''***^ which was obtained from 100 shuffled partner sequences. 
The ticks at the middle of the error bars indicate the mean, (b) Results of the merging test. 
and the confidential intervals (error bars) are plotted in the ascending order of /™°''s*^. xhe lower and 
upper ends of the error bars represent and 99 percentile points, respectively. The ticks at the middle 
of the error bars indicate the mean. 
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FIG. 6. Mutual information /j is plotted against (a) degree ki, (b) node strength Si, and (c) average 
node weight Wi, for Di. The Pearson correlation coefficient R between the plotted quantities is also 
shown. 
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FIG. 7. (a) Averaged neighborhood overlap {O)^ as a function of the fraction of links with weights 
smaller than w for Di. (b) Pearson correlation coefficient between Jj and Ci{wthT) (squares) and the 
partial correlation coefficient between them with A;j(it;thr) and Si{wthr) fixed (circles), for Di. The 
horizontal line represents zero correlation. 
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FIG. A2. Results of the bootstrap tests for D2 on the basis of (a) shufflmg and (b) merging of the 
partner sequence. See the caption of Fig. [5] for legends. 
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FIG. A3. Mutual information li is plotted against (a) degree /cj, (b) node strength Sj, and (c) average 
node weight Wj, for D2. The Pearson correlation coefficient R between the plotted quantities is also 
shown. 
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FIG. A4. (a) Averaged neighborhood overlap {0)w as a function of the fraction of hnks with weights 
smaller than w for D2- (b) Pearson correlation coefficient between Jj and Ci{wih^) (squares) and the 
partial correlation coefficient between them with ki{wthr) and Si{wthr) fixed (circles), for D2. 
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FIG. A5. Results of the bootstrap test of the finite size effect for (a) Di and (b) D2- h (circles) and 
the confidential intervals (error bars) of individuals are plotted in the ascending order of /j. The lower 
and upper ends of the error bars represent and 99 percentile points, respectively. The ticks at the 
middle of the error bars indicate the mean. 
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FIG. A6. (a) Cumulative distribution of the interevent intervals of a typical individual in Di (solid 
line). The dotted line represents the power-law fit with exponent —1.52, which wa obtained from the 
maximum likelihood test The dashed line represents the exponential distribution with the same 
mean as that of the data, (b) Distributions of the CV of p{t) in Di and D2. 
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FIG. A7. Normalized mutual information /j is plotted against (a) degree /cj, (b) node strength Sj, 
and (c) average node weight uJj, for D\. The Pearson correlation coefficient R between the plotted 
quantities is also shown, (d) Pearson correlation coefficient between /j and C^(^«thr) (squares) and the 
partial correlation coefficient between them with A;i(u;thr) and Sj(u;thr) fixed (circles). 
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FIG. A8. Results when the Unk weight is defined by the total duration of the conversation events for 
each pair, for Di. The mutual information Jj is plotted against (a) node strength Si and (b) average 
node weight Wi. The Pearson correlation coefficient R between the plotted quantities is also shown, (c) 
Averaged neighborhood overlap {0)w as a function of the fraction of links with weights smaller than 
w. (d) Pearson correlation coefficient between /j and Ci(it;thr) (squares) and the partial correlation 
coefficient between them with ki{wthr) and Si{wthr) fixed (circles). 
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FIG. A9. Comparison between the interpolated and original data for data sets Di and D2- (a) 
Node strength, (b) uncorrelated entropy, (c) conditional entropy, and (d) mutual information for the 
interpolated data with m = 1 are plotted against those without interpolation (i.e., original data sets). 
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FIG. AID. Results for the interpolated data with m = 1. We use data set Z?i. The mutual information 
is plotted against (a) node strength and (b) mean weight, (c) Averaged neighborhood overlap (0)-^; 
as a function of the fraction of links with weights smaller than w. (d) Pearson correlation coefficient 
(squares) and partial correlation coefficient (circles) between /j and Ci(w;thr)- 
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FIG. All. Comparison between the interpolated and original data for data sets Di and D2. (a) 
Node strength, (b) uncorrelated entropy, (c) conditional entropy, and (d) mutual information for the 
interpolated data with m = 5 are plotted against those without interpolation (i.e., original data sets). 
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